Robustness of methods for blinded sample size re-estimation with overdispersed count data.

نویسندگان

  • Simon Schneider
  • Heinz Schmidli
  • Tim Friede
چکیده

Counts of events are increasingly common as primary endpoints in randomized clinical trials. With between-patient heterogeneity leading to variances in excess of the mean (referred to as overdispersion), statistical models reflecting this heterogeneity by mixtures of Poisson distributions are frequently employed. Sample size calculation in the planning of such trials requires knowledge on the nuisance parameters, that is, the control (or overall) event rate and the overdispersion parameter. Usually, there is only little prior knowledge regarding these parameters in the design phase resulting in considerable uncertainty regarding the sample size. In this situation internal pilot studies have been found very useful and very recently several blinded procedures for sample size re-estimation have been proposed for overdispersed count data, one of which is based on an EM-algorithm. In this paper we investigate the EM-algorithm based procedure with respect to aspects of their implementation by studying the algorithm's dependence on the choice of convergence criterion and find that the procedure is sensitive to the choice of the stopping criterion in scenarios relevant to clinical practice. We also compare the EM-based procedure to other competing procedures regarding their operating characteristics such as sample size distribution and power. Furthermore, the robustness of these procedures to deviations from the model assumptions is explored. We find that some of the procedures are robust to at least moderate deviations. The results are illustrated using data from the US National Heart, Lung and Blood Institute sponsored Asymptomatic Cardiac Ischemia Pilot study.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Estimation and Outlier Detection for Overdispersed Multinomial Models of Count Data

Robust Estimation and Outlier Detection for Overdispersed Multinomial Models of Count Data We develop a robust estimator—the hyperbolic tangent (tanh) estimator—for overdispersed multinomial regression models of count data. The tanh estimator provides accurate estimates and reliable inferences even when the specified model is not good for as much as half of the data. Seriously ill-fitted counts...

متن کامل

Estimation of Count Data using Bivariate Negative Binomial Regression Models

Abstract Negative binomial regression model (NBR) is a popular approach for modeling overdispersed count data with covariates. Several parameterizations have been performed for NBR, and the two well-known models, negative binomial-1 regression model (NBR-1) and negative binomial-2 regression model (NBR-2), have been applied. Another parameterization of NBR is negative binomial-P regression mode...

متن کامل

Follow up after sample size re-estimation in a breast cancer trial for time to recurrence

In an international trial of premenopausal women with hormone receptor positive operable breast cancer that compares how the timing of surgical oophorectomy and mastectomy affects time to recurrence, we re-evaluated the required sample size near the end of the planned accrual period. We had anticipated that failure probabilities used at the design stage were too high resulting in a loss of powe...

متن کامل

Maximum Likelihood Estimation of the Negative Binomial Dispersion Parameter for Highly Overdispersed Data, with Applications to Infectious Diseases

BACKGROUND The negative binomial distribution is used commonly throughout biology as a model for overdispersed count data, with attention focused on the negative binomial dispersion parameter, k. A substantial literature exists on the estimation of k, but most attention has focused on datasets that are not highly overdispersed (i.e., those with k>or=1), and the accuracy of confidence intervals ...

متن کامل

A simulation study comparing likelihood and non-likelihood approaches in analyzing overdispersed count data

Overdispersed count data are modelled with likelihood and non-likelihood approaches. Likelihood approaches include the Poisson mixtures with three distributions, the gamma, the lognormal, and the inverse Gaussian distributions. Non-likelihood approaches include the robust sandwich estimator and quasilikelihood. In this simulation study, overdispersed count data were simulated under the Poisson ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Statistics in medicine

دوره 32 21  شماره 

صفحات  -

تاریخ انتشار 2013